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Abstract — We consider the problem of designing a fair schedul- 
ing algorithm for discrete-time constrained queuing networks. 
Each queue has dedicated exogenous packet arrivals. There 
are constraints on which queues can be served simultaneously. 
This model effectively describes important special instances like 
network switches, interference in wireless networks, bandwidth 
sharing for congestion control and traffic scheduling in road 
roundabouts. Fair scheduling is required because it provides 
isolation to different traffic flows; isolation makes the system 
more robust and enables providing quality of service. Existing 
work on fairness for constrained networks concentrates on flow 
based fairness. As a main result, we describe a notion of packet 
based fairness by establishing an analogy with the ranked election 
problem: packets are voters, schedules are candidates and each 
packet ranks the schedules based on its priorities. We then obtain 
a scheduling algorithm that achieves the described notion of 
fairness by drawing upon the seminal work of Goodman and 
Markowitz (1952). This yields the familiar Maximum Weight 
(MW) style algorithm. As another important result we prove that 
algorithm obtained is throughput optimal. There is no reason a 
priori why this should be true , and the proof requires non- 
traditional methods. 

Index Terms — Fair scheduling, ranked election, packet-based 
fairness, throughput optimality 



I. Introduction 

In this paper we focus on the problem of scheduling in 
constrained queuing networks. Specifically, we consider a 
collection of queues operating in discrete time with constraints 
on which queues may be served simultaneously. Such queuing 
systems serve as effective modeling tools for a large array 
of important practical problems, and their performance is 
crucially dependent on the effectiveness of the scheduling 
algorithm. 

In this setup the basic question is to design a scheduling al- 
gorithm that is optimal. There are several performance criteria, 
with often an inherent trade-off, that determine the optimality 
of a scheduling algorithm. The first is throughput optimality. 
A queuing system has a limited amount of resources. The 
natural constraints imposed result in an inherent limitation on 
the amount of traffic load that can be supported. This is called 
the capacity of the system. Roughly speaking, a scheduling 
algorithm that achieves the capacity utilizes system resources 
optimally. Such an algorithm is called throughput optimal. 

Apart from being throughput optimal, a scheduling algo- 
rithm should allocate resources in a fair manner. The queuing 
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system is a common resource shared by various traffic flows, 
and the scheduling algorithm should ensure that no flow is 
receiving more than its "fair" share of resources. It is important 
to realize that fairness in queuing systems is not only an 
intuitively desired goal but also one with an immense practical 
impact. 

A very important consequence of fairness is isolation of 
various traffic flows. Throughput optimality is oblivious to the 
identities of the different flows. But, identities are important 
for the following two important reasons: (1) Being oblivious 
to flow identities, throughput optimal algorithms often favor 
flows with a high data rate. Therefore, a particular flow might 
ill-behave and flood the system with a high data rate mali- 
ciously resulting in the deterioration of service to other flows. 
Since the system is a shared resource, the algorithm should 
identify the rogue flow and limit the negative impact on well- 
behaved flows. (2) Secondly, isolation is important to provide 
performance guarantees, and thereby Quality of Service (see 
Keshav (1997) [1]), to various flows in the system. Designing 
a scheduling algorithm that is fair will overcome these issues. 
Other benefits of fairness include reducing burstiness of flows, 
eliminating bottlenecks and reducing the impact of certain 
kinds of Denial-of-Service (DoS) attacks (see Bonald and 
Massoulie (2001) [2], and Yau et al (2005) [3]). In essence, 
a fair scheduling algorithm makes the queuing system robust 
and less prone to manipulation by individuals. 

A natural way to achieve isolation among flows, in order to 
provide protection and performance guarantees, is to dedicate 
resources to each of the flows. In fact, this is the approach 
taken in most of the work done on designing fair algorithms 
for input queued switches (details in Section I-B). This ap- 
proach, though, is limited for the following reasons: Firstly, 
because of constraints, it is not straightforward to determine 
the amount of resources to be allocated for each flow in a 
queuing network. Moreover, such determination would require 
the knowledge of flow arrival rates; whereas, in the spirit of 
being implementable, we require the scheduling algorithm to 
be online i.e., use only current network state information like 
queue-sizes, age of packets, etc., and myopic i.e., oblivious to 
flow arrival rates. Secondly, resource allocation takes place on 
an average over a long period of time. This is appropriate in 
a flow level model where the arrival statistics remain constant 
for long time periods. This assumption, though, is questionable 
in many applications like Internet traffic where short flows 
predominate. 

We note that designing a fair scheduling algorithm com- 
prises two sub-problems: defining a reasonable notion of fair- 
ness, and designing an algorithm to achieve the desired notion 
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of fairness. The notion of fairness is utilized to determine 
the resources (more specifically, the rate or bandwidth) to 
be allocated and the scheduling algorithm ensures that the 
resources are allocated on an average over a long period of 
time. Inspired by this, our approach would be to first define a 
notion of packet based fairness and then design a scheduling 
algorithm to achieve the defined notion of fairness. For obvious 
reasons, we also need to reconcile the benefits of fairness with 
achieving throughput optimality. 

Motivated by the above discussion, we attempt to achieve 
the following three goals in this paper: (1) Define a notion 
of packet based fairness. (2) Design an online and myopic 
algorithm, that is also throughput optimal, to achieve the 
notion of fairness. (3) Provide the throughput optimality proof 
of the algorithm. 

A. Our contribution 

The need for a packet based notion of fairness that can 
be used in a constrained network is clear. But, defining 
a precise mathematical notion of fairness that achieves the 
desired intuitive and practical benefits of fairness is quite 
challenging. Unfortunately, none of the existing notions of 
fairness directly extend to a packet based constrained queuing 
network. Existing notions of flow based fairness are based 
on the utility maximization framework (proposed by Kelly, 
Maullo and Tan (1998) [4]), which is a concept borrowed from 
Economics literature. In a similar spirit, we define a notion of 
fairness by establishing a novel analogy between scheduling in 
constrained queuing networks and a ranked election problem^. 
Ranked election is a widely studied problem in the Economics 
literature and this analogy provides a ready framework to 
leverage this work. We draw upon the work of Goodman and 
Markowitz (1952) [5] to obtain a unique characterization of the 
schedule. This, rather surprisingly, yields a maximum weight 
(MW) style algorithm. MW style algorithms are very popular 
in the literature and are very well understood. Thus, MW 
algorithms choose schedules in a "fair" manner, though the 
definition of "fair" for different weights is different. It should 
be noted that the choice of weights is crucial for obtaining 
the intuitive desirable properties of fairness and we make an 
important contribution here. 

As another important contribution, we prove that our algo- 
rithm is throughput optimal. There is no a priori reason for this 
to be true. Even though the algorithm we design is the familiar 
MW style algorithm, it is not queue size or waiting time 
based. Therefore the traditional methods of proving throughput 
optimality, which include the popular Lyapunov-Foster method 
and Fluid models, cannot be applied in a direct manner. The 
proof technique we introduce to prove throughput optimality 
is potentially applicable to a wider class of problems. 

B. Related work 

We first begin with the work on single queue fairness. 
Fair scheduling in single queues has been widely studied 

+A ranked election problem deals with choosing a winning permutation 
of candidates using a set of votes, where each vote is a permutation of the 
candidates. Refer to Section IV. 



since the early 1990s. In one of the earliest works, John 
Nagle (1987) [6] proposed a fair algorithm for single queues 
called "Fair Queuing." As mentioned earlier, fair scheduling 
is required to minimize starvation and limit the negative 
impact of rogue sources. In order to achieve this, Nagle 
proposed maintaining separate queues for different flows and 
serving them in a round-robin fashion. This is a great and 
simple to implement solution, but it works only when all 
the packets are of equal size. In order to overcome this 
problem Demers, Keshav and Shenker (1989) [7] proposed the 
notion of Weighted Fair Queueing (WFQ) and its packetized 
implementation. Parekh and Gallager (1993, 1994) [8], [9] 
analyzed the performance of this packetized implementation 
and showed it to be a good approximation of Generalized 
Processor Sharing (GPS). Shreedhar and Varghese (1996) [10] 
designed a computationally efficient version of weighted fair 
queuing called Deficit Weighted Round Robin (DWRR). Even 
though all these algorithms are fair, they are very complex and 
expensive to implement. Hence, there was a lot of work done 
on achieving approximate fairness for Internet routers through 
FIFO queuing and appropriate packet dropping mechanisms. 
Examples include RED by Floyd and Jacobson (1993) [11], 
CHoKe by Pan, Prabhakar and Psounis (2000) [12], and AFD 
by Pan, Prabhakar, Breslau and Shenker (2003) [13]. 

To address the issue of fairness in a network, Kelly, 
Maullo and Tan (1998) [4] proposed a flow-level model for 
the Internet. Under this model, the resource allocation that 
maximizes the global network utility provides a notion of fair 
rate allocation. We refer an interested reader to survey-style 
papers by Low (2003) [14] and Chiang et.al. (2006) [15] and 
the book by Srikant (2004) [16] for further details. We take 
a note of desirable throughput property of the dynamic flow- 
level resource allocation model (see for example, Bonald and 
Massoulie (2001) [2], and de Veciana, Konstantopoulos and 
Lee (2001) [17]). This approach, though valid for a general 
network with arbitrary topology, does not take scheduling 
constraints into account. 

We next review the work done on the design of fair schedul- 
ing algorithms for Input Queued (IQ) switches. Switches 
are the most simple - at the same time, highly non-trivial 
- examples of constrained networks. They form the core 
of Internet routers and there is extensive literature dealing 
with the design and analysis of various switch architectures 
and scheduling algorithms. A switch is essentially a bipartite 
network with input ports and output ports. The function of 
a network switch is to move packets from the input ports to 
the output ports using the switch fabric, just like the traffic 
at a traffic junction. Depending on the placement of buffers 
and the switch fabric, there are mainly two kinds of switch 
architectures - input queued switches (IQ) and output queued 
(OQ) switches. As their names suggest, input queued switches 
have buffers only at input ports, while output queued switches 
have buffers only at the output ports. The input queued switch 
has a cross-bar switch fabric that imposes the following natural 
constraints: only one packet can be moved from (to) each 
input (output) port in each time slot. On the other hand, since 
an output queued switch has buffers only at output ports, 
packets arriving at the input ports are immediately transferred 
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to their respective output buffers. Thus, there are no scheduling 
constraints at the switch fabric in an output queued switch. 
Because of this, the memory in an output queued switch has 
to operate much faster than the memory in an input queued 
switch. In most high-speed switches, memory bandwidth is 
the bottleneck and hence input queued switches are more 
popular and widely deployed, while output queued switches 
are idealized versions that are easy to study. 

It is clear from the description that scheduling in output 
queued switches is equivalent to that of single queues. Hence, 
fair scheduling in output queued switches just corresponds to 
implementing single queue fair algorithms at different output 
queues. Unfortunately, such extension is not possible for input 
queued switches because of the presence of constraints. One 
approach is to emulate the performance of an OQ switch by 
means of a IQ switch running with a minimal speedup. An 
IQ switch is said to be running with a speedup S, if it can be 
scheduled S times in each time slot. This approach was taken 
by Prabhakar and McKeown (1999) [18] and Chuang, Goel, 
McKeown and Prabhakar (1999) [19], where they showed 
that essentially a speedup of 2 is necessary and sufficient 
for emulating an OQ switch. With the OQ switch operating 
under any of the various policies like FIFO, WFQ, DWRR, 
strict priority, etc. fairness can be achieved. Equivalently, if an 
IQ switch is loaded upto 50% of its capacity and the notion 
of fairness is defined by policies like FIFO, WFQ, DWRR, 
strict priority, etc., then by emulating an OQ switch with these 
policies, it is possible to have fair scheduling for the IQ switch. 
However, for higher loading this approach will fail due to 
inability of emulating an OQ switch. 

This necessitates the need for defining an appropriate no- 
tion of fairness that cleverly and in a reasonable manner 
combines the preferences of packets based on some absolute 
notions along with the scheduling constraints. In principle, this 
question is very similar to the question answered by utility 
maximization based framework for bandwidth allocation in a 
flow network. In fact, most of the existing literature on fair 
scheduling algorithms for input-queued switches is concerned 
with the notion of flow-based fairness. In these approaches, 
a flow is identified with all the packets corresponding to an 
input-output pair. There are two main approaches taken in the 
literature for the design of fair algorithms for IQ switches. 
One class of fair algorithms implement a fair scheduling 
scheme at each of the servers in the switch and then carry 
out an iterative matching. This approach is based on the 
Distributed Packet Fair Queuing architecture. Examples of 
this approach include iPDRR proposed by Zhang and Bhuyan 
(2003) [20], MFIQ proposed by Li, Chen and Ansari (1998) 
[21], and iFS proposed by Ni and Bhuyan (2002) [22]. This 
approach completely ignores fairness issues that arise because 
of scheduling constraints and hence need not guarantee an 
overall fair bandwidth allocation. In order to overcome this, 
Hosaagrahara and Sethu (2005) [23] and more recently Pan 
and Yang (2007) [24] propose algorithms to calculate overall 
max-min rates of different flows, taking into account con- 
tention at all levels. But this approach requires knowledge of 
rates of flows and hence, the system should either learn these 
rates or know them a priori. 



Thus, most of the literature on the design of fair scheduling 
algorithms for constrained networks is limited in either ig- 
noring fairness issues caused due to scheduling constraints or 
directly borrowing flow-based fairness notions and allocating 
bandwidth accordingly. Here, it is important to emphasize the 
limitations of a flow-based approach: (a) network traffic pre- 
dominantly contains "short-flows", while flow-based approach 
requires existence of ever-lasting traffic thereby inducing huge 
delays when applied naively, (b) flow-based approach requires 
knowledge of traffic rates, which it may have to learn, (c) our 
unit of data is a packet and modeling it as a flow is just an 
approximation, and (d) packets have priorities and they lack 
explicit utility functions. 

In summary, our question is inherently combinatorial which 
requires dealing with hard combinatorial constraints unlike the 
resource allocation in a flow network which deals with soft 
capacity constraints in a continuous optimization setup. 

C. Organization 

The rest of the paper is organized as follows: Section II 
describes the model, introduces the notation and states the 
problem formally. Section III motivates and describes our 
approach. Section IV takes a digression into Political Science 
literature to explain the ranked election problem. Section V 
establishes the analogy between the ranked election problem 
and network scheduling. Sections VI and VII present the 
main results of this paper. Section VI formally states our 
algorithm; while Section VII provides the details of the proof 
of throughput optimality. We provide some simulation results 
in Section VIII and then finally conclude in Section IX. 

II. Model and Notation 

We now describe a generic abstract model of a constrained 
queuing network. The model corresponds to a single-hop net- 
work. This generic model describes important special instances 
like an input queued switch, wireless network limited by 
interference, congestion control in TCP or even traffic in a 
road junction. In each of these instances, the model effectively 
captures the constraints imposed by nature on simultaneous 
servicing of queues. We will describe the examples of an input 
queued switch and a wireless network in detail. We focus on 
these two examples because they encapsulate a large class of 
scheduling problems. 

A. Abstract formulation 

Consider a collection of TV queues. Time is discrete and 
is indexed by t 6 {0,1,...}. Each queue has a dedicated 
exogenous process of packet arrival. The arrival processes of 
different queues are independent. All packets are assumed to 
be normalized to unit length. Arrivals to each queue occur 
according to a Bernoulli process. 

The service to the queues is subject to scheduling con- 
straints, in that not all queues can be served simultaneously. 
The scheduling constraints present are described by a finite set 
of feasible schedules y C {0, 1}^. In each time slot a feasible 
schedule 7r e y is chosen and queue n is offered a service 
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7r n in that time slot. Since each packet is of unit length, when 
a non-empty queue receives service a packet departs from the 
queue. We will assume that 5? is monotone i.e., if 7r G S?, 
then for any a < n component-wise i.e., a n < ir n , a G 5? . 

Packets exit the system from any of the M output lines. The 
lines are assumed to operate at unit speed and hence at most 
one packet can leave the network from each line in each time 
slot. Each of the M output lines maintain buffers termed output 
queues to store packets that are ready to depart. We assume 
that routing is pre-determined and hence the destination output 
line of each packet is known. After service each packet is 
moved to its destination output queue. Each output queue 
operates using a single queue scheduling policy (eg. First 
Come First Serve (FCFS), Weighted Fair Queuing (WFQ) 
etc.). The served packets are queued and served according 
to the single queue scheduling policy. Fig. 1 illustrates this 
model. 

Under this setup, the problem of scheduling is to choose 
a feasible schedule in each time slot to serve the queues and 
move the packets to their respective output queues. Since the 
scheduling policy for each of the output queues can be chosen 
independently, the problem reduces to the of the constrained 
collection of queues. 




j. T „ , , M Output Queues 

A-Queuc Constrained ± ^ 

Queuing System 

Fig. 1. The abstract model of the constrained queuing system 

1) Notation: First, some general notation. K will denote 
the set of real numbers and K + the set of non-negative real 
numbers i.e., M + = {x G M : x > 0}. N will denote the set 
of natural numbers {1,2,...} and Z + the set of non-negative 
integers {0,1,2,...}. Let and 1 denote the vectors of 0s and 
Is respectively. All the vectors in this paper will be length N 
vectors. Let denote the indicator function, l{ true } = 1 
and l{fai se } = 0. x + denotes max{x,0} and we use the l\ 
norm \x\ = 2~2 n \ x n\- We also use the standard inner product 
(a,b) =J2 a i b i- 

Recall that we are assuming Bernoulli i.i.d arrivals. Let 
A n (r) G {0,1} denotes the number of arrivals to queue n, 
n = 1, 2, . . . , N, during time slot r. The arrival rate to queue 
n is denoted by A„ i.e., PrL4 n (r) = 1) = A„ Vr. A = (A n ) 
will denote the arrival rate vector. Q n {r) denotes the length 



of queue n at the beginning of time slot r. Q(t) = (Q n (j)) 
denotes the queue length vector. S{t) G 5? denotes the 
feasible schedule chosen in time slot t to serve the queues. 
Without loss of generality, we assume that a feasible schedule 
is chosen and service happens at the middle of the time slot, 
and exogenous arrivals occur at the end of the time slot. This 
is shown in Fig. 2. With Q n {r) denoting the queue length at 
the beginning of the time slot r, we have: 

Q»(r + 1) = (Qn(r) - S n (r)) + + A n (r) (1) 

Qn(T) A n (T) 



T 



5 'n(' r )l{Q„(T)>0} 
Fig. 2. The order in which service and arrivals happen during a time slot r 

Let D n (r) denote the cumulative departure process of queue 
n i.e., 

„(t)>o} ( 2 ) 

2) Definitions: We now introduce some definitions. We call 
a system rate stable, or simply stable in this paper, if the 
following holds with probability 1: for 1 < n < N, 

V D n( T ) . 

hm = A„ (3) 

r^oo T 

An arrival rate vector A = (A„) is called admissible if 3 
a scheduling policy under which the queuing network loaded 
with A has a queue size process Q„(r) such that 

limsupE[|Q(V)|] < oo (4) 

T 

Let A denote the set {A G : A is admissible}. A is 
called the throughput region or capacity region of the network. 
Tassiulas and Ephremides (1992) [25] proved that: 

rclint co(y) CAC co(y) (5) 

where co(S') denotes the convex hull of 5? i.e., 
G R£ : ii = Ei ow\ on > 0, tt* G S, on < 1}. 
relint co{y) denotes the relative interior of co{.y) and 
co(S^) denotes the closure of co(<9 7 ). Denote relint co(,y) by 
A'. An arrival vector A is called strictly admissible if A G A'. 
It was also shown by Tassiulas and Ephremides (1992) [25] 
that: 

A' = {^elf : J u = ^a J 7r l ; 

i 

a j >0,7r i e5,5^o i <l} (6) 
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3) Constraint free network (CFN): We now introduce the 
notion of a Constraint Free Network (CFN). A constraint 
free network is defined as a queuing network in which all 
the queues can be served simultaneously. Therefore, for a 
CFN y = {0,1}^ and J^ max = {1}. Thus, scheduling just 
entails moving arriving packets immediately to their respective 
destination output queues. 

As discussed in Related Work (Section I-B), fairness is well 
understood for single queues. Therefore, using a single queue 
fair scheduling scheme for each of the output queues yields 
a fair scheduling algorithm for the CFN. We will assume 
throughout that a CFN is operating using a fair scheduling 
algorithm. Along with a CFN we define a shadow CFN as 
follows: Given a constrained queuing network Af, a CFN 
A/ 7 with the same number of queues and fed with copies of 
exogenous arrivals to Af, is called the shadow CFN of Af. 

We conclude this section with a brief motivation for the 
definition of CFN. This will also serve as a preview to our 
approach to the problem. As mentioned earlier, the difficulty 
in designing a fair scheduling algorithm for networks arises 
because of the presence of constraints. In the absence of 
such constraints, the notion of fairness is equivalent to that 
of a single queue. Thus, we define an ideal network that is 
constraint-free, whose performance we want to emulate. This 
is in some sense the best we can do in providing fairness 
and thus it will serve as a benchmark for defining notions of 
fairness. 

B. Scheduling algorithms 

We consider the problem of designing scheduling algorithms 
for constrained queuing networks. A scheduling scheme or 
algorithm is a procedure whereby an appropriate feasible 
schedule is chosen in each time slot. In this paper, we will 
be interested in a class of scheduling algorithms termed the 
Maximum Weight (MW) Scheduling Algorithms. In general, 
a maximum weight algorithm works as follows: In each time 
slot t, each queue n is assigned a weight w„(r). This weight 
is usually - but not necessarily - a function of the queue size 
Q n (r). Then, the algorithm chooses the schedule with the 
maximum weight i.e., 

S(t) G argmax(w(r), n) (7) 

A feasible schedule tt e 5? is said to be maximal if 
Vcr e y , 7r ^ a component wise. The set of all maximal 
feasible schedules will be denoted by y ma . It is reasonable 
to assume that we want to serve as many queues as possible 
in each time slot. Therefore, when the algorithm chooses 
a feasible schedule 7r, we serve the queues according to a 
maximal schedule fi e ^ max suc h that fi > tt. 
Remark. An important special instance of the MW schedul- 
ing algorithm is the one with queue sizes as the weights 
i.e., w„(t) = Q n (r). In their seminal work, Tassiulas and 
Ephremides (1992) [25] (and independently McKeown et. al. 
(1996) [26]) showed that the MW algorithm with queue sizes 
as weights is rate stable. 

Since these results, there has been a significant work on 
designing high-performance, implementable packet scheduling 



algorithms that are derivatives of maximum weight scheduling, 
where weight is some function of queue-sizes. All of these 
algorithms are designed to optimize network utilization as 
well as minimize delay (for example, see recent work by 
Shah and Wischik (2006) [27]). However, these algorithms 
ignore the requirement of fairness. Specifically, it has been 
observed that the maximum weight based algorithm can lead to 
unwanted starvation or very unfair rate allocation when switch 
is overloaded (for example, see work by Kumar, Pan and Shah 
(2004) [28]). We provide a simple example of a switch to 
illustrate this: Consider a 2 x 2 switch with arrival rate matrix 
An = A21 = 0, A12 = 0.6, A 2 2 = 0.5. Here Ay corresponds 
to the arrival rate of traffic at input port i for output port j. 
Under this loading, output port 2 is overloaded. If OQ switch 
has Round Robin (or Fair) policy at output 2 so that traffic 
from both inputs is served equally, then input 1 will get rate 
0.5 and input 2 will get rate 0.5 from output 2. However, the 
maximum weight matching policy, with weight being queue- 
size (or for that matter any increasing continuous function 
of queue-size), the algorithm will try to equalize lengths of 
queues at both inputs. Therefore, input 1 will get service rate 
0.55 while input 2 will get service rate 0.45 from output 2. 

C. Input queued switch 

We now discuss an input queued switch as a special instance 
of the abstract model that we have described. As mentioned 
before, a switch is present at the core of an Internet router. 
A router moves packets from input ports to output ports. 
Based on the final destination of the arriving packet, a router 
determines the appropriate output port and then transfers the 
packet accordingly. The transfer of packets to the correspond- 
ing output ports is called switching. 

There are various switching architectures, but we discuss 
the one that is commercially the most popular. Consider an 
input queued switch containing N input ports and N output 
ports. Packets arriving for input port i and destined for output 
port j are stored at input port i in Qy. The switch transfers 
packets from input ports to output ports using the switching 
fabric. The crossbar switching fabric implemented in an input 
queued switch imposes the following constraints on packet 
transfer from input to output ports: in each time slot, each 
input port can transmit at most one packet and each output port 
can receive at most one packet. Therefore, feasible schedules 
are matchings from input to output ports. This is illustrated in 
Fig. 3. The left and right hand figures illustrate two different 
possible matchings. 



Input 1 



Input 2 



Input 3 



Output 1 Output 2 Output 3 

Fig. 3. A3 port Input queued switch showing two different possible 
matchings 
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The scheduling algorithm in the input queued switch 
chooses the appropriate matching in each time slot. To link 
back to the abstract model that we described, note that an N 
port switch has N 2 constrained queues; in the case of queues 
we will use the notation and not •„ for all terms to clearly 
reference the input and output ports. The set of all feasible 
schedules 5? corresponds to the set of all matchings in an 
N x N bipartite graph: 



y=U= e {0, 1}"*" : ]T 7r 4fc < 1; 

I fe=i 

N \ 

^7r fej -.< 1,1 <i,j<N\ (8) 
fe=l J 
Packets leave the switch from their respective output ports 
and hence the output ports correspond to the output queues. 
Since at most one packet arrives at each output port in each 
time slot, the packet immediately departs from the output 
queue. Thus, scheduling reduces to choosing an appropriate 
matching in each time slot. We point an interested reader to 
[19] for a more detailed exposition on switch architectures. 

D. Wireless networks 

We now consider wireless networks as a special instance 
of the abstract model. Consider a collection of devices (eg. 
sensor nodes, WiFi nodes, etc.) that are using the wireless 
medium to transmit messages. The devices share the same 
frequency to transmit and hence interfere when they transmit 
simultaneously. Because of power constraints, only devices 
that are close to each other geographically can communicate. 
This is often modeled by a graph with a node for each device 
and an edge between two nodes if they can communicate. 

Power constraints also limit interference to nodes that are 
close to each other; in other words, only nodes that are 
connected by an edge interfere. Therefore, the graph also 
models the interference constraints on scheduling. In other 
words, transmission to a node is successful only if none of 
its neighbors in the graph is transmitting at the same time. 
This model of interference is termed independent set model 
of interference. 

We assume that the network is modeled as a graph Q = 
(V, £) with V denoting the node set {1,2,..., N} and £ de- 
noting the directed edge set : i communicates with j}. 
Each node i maintains a queue Qij for each of its neighbors j. 
We will assume a single hop network in which packets arrive 
at nodes, get transmitted to one of the neighbors and then 
leave the network through output queues. Fig. 4 illustrates a 
wireless network with three nodes operating under interference 
constraints. 

In this setup, the scheduling problem is to decide which 
directed links will be active simultaneously. Constraints 
limit feasible schedules to those in which none of the 
neighbors of a receiver is transmitting; in other words, 
if link is active then none of the links in the set 

{(l,k) e £ : £ £ OR (j, I) G £} should be active. For 
each network represented by graph Q = (V, £), we can 




Fig. 4. A3 node wireless network 

construct a conflict graph Q' = (V, £' ) with a node for each 
of the directed links and an edge between two links if they 
cannot be active simultaneously. The feasible schedules then 
reduce to independent sets in the conflict graph. Formally, 

y = |tt e {0, 1} |V '' : 7T, + TTj < 1, for all e £'} (9) 

It should be noted that using the conflict graph, more general 
constraints in the network can be modeled as independent set 
constraints. Thus, the model we are considering encapsulates 
the essence of a large class of scheduling problems. 

III. Our approach 

Network resources are shared by different users and our goal 
is to design a scheduling algorithm that allocates resources 
in a fair manner. Before we can design such an algorithm, 
there is a need to give a precise definition of the users and 
the resources of the network. Traditionally, different traffic 
flows were considered the users and the bandwidth allocated 
to them the resource of the network. Link capacity constraints 
limited the total resource available and each flow was allocated 
its "fair" share of bandwidth. This is the basis of the utility 
maximization framework in which the utility of each flow 
was a function of the allocated bandwidth; the greater the 
bandwidth allocated the greater was the utility. Different 
utility functions yield different fairness criteria. An inherent 
limitation of this approach is that it considers entire flows as 
users, disregarding the fact that flows are not continuous but 
are composed of packets. Moreover, bandwidth is a resource 
that is allocated on an average over a long period of time 
assuming that flow statistics remain constant over such long 
periods. 

We overcome this limitation by treating the Head-of-Line 
(HoL) packet of each flow as the user of the network resources 
(We assume that each queue is associated with a flow and 
hence we use these terms interchangeably). This takes into 
account the packetized nature of a flow and is a consequence of 
the realization that in each time slot the decision is to whether 
serve the HoL packet or not, unlike the case of a continuous 
flow that can be given fractional service. Therefore, utilities 
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should correspond to HoL packets and not entire flows. With 
HoL packets as the users, the network resource becomes the 
service they receive in each time slot. The network resource 
is limited by the constraint set =5^ and the algorithm should 
choose a feasible schedule it 6 y in a manner that is "fair" 
to all the HoL packets. Inspired by the utility maximization 
framework, we could define utility functions for the HoL 
packets and choose a feasible schedule that maximizes the 
overall utility. But, there is no natural choice of the utility 
function and hence we take a different approach. 

We begin with the realization that packets do not have 
natural utility functions, but they do have a natural preference 
order of the feasible schedules. Inherently each packet prefers 
to be served. Therefore, a packet prefers all schedules that 
serve it (i.e., all schedules that serve the queue containing it) to 
all schedules that do not. Further, among those schedules that 
do serve it, the packet is indifferent. Similarly, it is indifferent 
to all the schedules that do not serve it. Fair scheduling now 
reduces to combining individual preferences in a fair manner 
to come up with a "socially" preferred schedule. This is 
equivalent to a ranked election problem: HoL packets (queues) 
are voters, schedules are candidates and each packet has a 
preference list of the schedules (refer to Section IV for more 
details on the ranked election problem). The question of ranked 
election is very well studied in the Economics literature (also 
called the theory of social choice). In their seminal work in 
1952, Goodman and Markowitz [5] prove that under certain 
socially desirable postulates (detailed in IV), when the voters 
have cardinal (quantitative) preferences over candidates, a 
simple function of those quantitative preferences will yield 
a uniquely preferred outcome. 

In order to use the Goodman and Markowitz result, we re- 
quire not just relative preferences, but quantitative preferences 
over the feasible schedules. By quantitative preferences we 
mean that each packet assigns numerical weights to schedules; 
the higher the weight the more preferred is the schedule. In 
principle the packet can assign any weights, consistent with its 
relative order of preferences, to obtain quantitative preferences 
over the schedules. But, it is important to realize that the 
choice of quantitative preference is crucial for obtaining the 
practically desired benefits of the fair scheduling. In our setup, 
each packet has two classes of schedules; one that it prefers 
to the other while being indifferent to schedules within the 
same class. Therefore, the packet assigns the same weight 
to all the schedules within the same class. We assume that 
each packets assigns the same weight, say *, to all schedules 
in the class it does not prefer. Now, assigning quantitative 
preferences reduces to choosing a weight for each packet to 
assign to the class of schedules it prefers. 

One feasible option would be to use queue sizes as weights. 
The problem with this choice is that it is oblivious to flow 
identities and is susceptible to manipulation (a flow can gain 
advantage by overloading system with packets resulting in 
large queue sizes). Another option would be to use the age 
(waiting time in the system) of the packet. This choice is still 
oblivious to flow and packet identities and it is difficult to 
provide QoS by giving priority to one flow (packet) over other. 
We overcome these problems by using the idea of emulation of 



the shadow CFN. As mentioned in Related Work (Section I-B), 
one way of designing a fair scheduling algorithm would be to 
perfectly emulate a CFN using a fair queuing policy at each 
of the output queues. But this results in a loss of throughput 
of the system. Therefore, our approach would be to emulate 
the shadow CFN as closely as possible. In this spirit, we use a 
function of the departure time of the packet from the shadow 
CFN as the weight; the earlier the departure time the higher 
the weight. The details of the exact function used are covered 
in Sections V and VI. 

We now tie this back to the utility maximization framework. 
Using the Goodman and Markowitz algorithm with the above 
choice of weights yields a MW style algorithm with the 
weight of each queue equal to the weight assigned to the 
packet. This is identical to the result we obtain by using 
the assigned weights as utilities of packets and choosing a 
schedule that maximizes overall utility. Therefore, our algo- 
rithm yields utility functions for packets that can be used 
in the utility maximization framework. This rather surprising 
result connects our approach back to utility maximization very 
nicely. 

We then establish that such an algorithm is throughput 
optimal under the standard stochastic model of a network. To 
prove throughput optimality (rate stability to be precise), we 
use an appropriate quadratic Lyapunov function. However, we 
are unable to use the standard stability proof technique based 
on Foster's criterion, as the Lyapunov function is not a function 
of queue-size, but is function of preferences derived from the 
shadow CFN. This makes the analysis rather non-trivial. 

To explain the consequences of our algorithm on fair emula- 
tion, we present simulations for algorithms based on FIFO OQ 
switch. Intuitively, our fair algorithm should be able to reduce 
the queue-size (or delay) as well as get rid of starvation caused 
by well-known throughput optimal algorithm. Our simulation 
results clearly confirm this intuition. 

IV. Ranked election 

In this section we take a digression into Political Science 
literature to describe the ranked election problem. 

Definition 1 (Ranked election): There are M voters that 
vote for C candidates. Vote of each voter consists of a ranking 
(or permutation) of all C candidates. These votes can addition- 
ally carry quantative values associated with their preferences. 
Let a mc denote the value voter m gives to candidate c, for 
1 < m < M, 1 < c < C. The goal of the election is 
to relative order all the C candiates as well as produce the 
ultimate winner in a manner that is consistent with the votes. 
The key for a good election lies in defining consistency of the 
outcome of election with votes. The following are canonical 
postulates that are used in the literature on ranked election: 
PI. Between any two candidates c and c', suppose that none 
of the M voters prefers c' to c and at least one voter 
prefers c to c'. Then c' should not be ranked higher than 
c in the output of the election. This property corresponds 
to the economic notion of weak Pareto optimality. 
P2. Suppose the voters are renumbered (or renamed) while 
keeping their votes the same. Then the outcome of 
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election should remain the same. In other words, the 
election outcome is blind to the identity of the voters, 
that is election outcome is symmetric. 
P3. Now, consider the setup when the votes are cardinal (i.e., 
quantitative). Suppose candidate c is preferred to c' by 
the election. Then, by adding the same fixed constant 
to all a mc and a mc > for 1 < m < M, the relative 
order of candidates c and d should not change. This 
makes sense because what matters is the difference in 
preference levels for the two candidates, not the actual 
values. 

In the absence of cardinal (or quantitative) preferences, the 
question of ranked election with postulates PI, P2 (and some 
additional postulates) was first studied by Arrow (1951) [29]. 
In his celebrated work, he established the (then) very surpris- 
ing impossibility of the existence of any election scheme that 
satisfies PI, P2 (and additional postulates) simultaneously. We 
note that this result has been an important corner stone in the 
field of theory of social choice. 

Subsequent to Arrow's impossibility result, many 
economists started looking for positive results. Among 
many other celebrated results, the result that is relevant to 
this paper is that of Goodman and Markowitz (1952) [5]. 
They showed that if voters have cardinal preferences, as in 
our setup, then there is a unique ordering of candidates that 
satisfies P1-P2-P3 simultaneously. To describe their result, 
consider the following: let the net score of a candidate c 
be s c = J2m=i a mc- Goodman and Markowitz obtained the 
following remarkable result. 

Theorem 1: Suppose the scores of all candidates are dis- 
tinct. Rank candidates as follows: candidate c has higher 
ranking than d if and only if s c > s c >. This ranking satisfies 
postulates P1-P2-P3. Further, this is the only such ranking. 
For a proof of this result, we refer a reader to [5]. 

V. Analogy Between Fair Scheduling and Ranked 
Election 

We will now formally establish the equivalence between 
fair scheduling in queuing systems and the problem of ranked 
election. In our context, the packets (queues) are the voters and 
the feasible schedules it G y are the candidates. In order to 
use the Goodman and Markowitz setup, we require quantitative 
preferences of packets over feasible schedules. 

As mentioned earlier, each packet inherently prefers being 
served and hence prefers all schedules that serve it to all 
schedules that do not. Therefore, for each packet there are two 
classes of schedules - one class containing all the schedules 
that serve it and the other containing all the schedules that do 
not. Moreover, the packet is indifferent to all the schedules 
within the same class. Since it is only the relative quantitative 
preferences that matter, we assume that each queue n assigns 
the same value, say *, to all the schedules that do not serve it. 
It now only remains to fix the value that each queue assigns 
to the class of schedules that serve it. 

For that, we will use a shadow CFN. As defined before, 
a copy of every packet arriving to the network Af is fed to 
the shadow CFN A/"'. That is, (copy of) a packet arriving at 



queue n for output queue m of Af immediately joins the output 
queue m in A/ 7 . The departures from the output queues of Af' 
happen according to an appropriate fair scheduling policy, say 
V, such as strict priority scheme, last-in-first-out or simply 
first-in-first-out. Let d„ be the departure time of k th packet 
arrived at queue n from Af' . Clearly, these departure times are 
dependent on the policy V used at the queues. If emulation 
was possible, the network Af would send out the packets at 
exactly the same times as (c^)„,fc. However, at speedup 1 it is 
not possible to emulate Af' perfectly as shown by Chuang et. 
al. [19]. Thus, the whole challenge is to respect the scheduling 
constraints of the network, while letting packets depart from 
the network so that they are as faithful to the departure times 
obtained from shadow CFN as possible. 

Based on the above discussion, our approach would be to 
use a value that is a function of the departure time of the packet 
from Af'; the earlier the departure time the higher is the value 
assigned. More specifically, let ip T n denote the HoL packet in 
queue n of network Af. Let d n (r) denote the departure time 
of from Af'. For the following discussion we assume that 
the queue is non-empty and hence d n (r) is well defined. We 
defer the discussion of empty queues to the next section. Now, 
each queue n assigns a value of — d n (r) to all the schedules 
that serve it. (The choice of — d n (r) seems arbitrary, when we 
could have taken any decreasing function of d n (r). Indeed we 
can, though it should have some "nice" properties to maximize 
throughput. Details are in Section VI). This completes the 
equivalence. 

Now, with the above assignment of values to each schedule 
by each queue, the value of a schedule 7r 6 5? is given as: 

N 

value(Tr) = - ]T 7r„d„(r) + {2 N - N)* 

n—1 

= {-d(T),TT)+(2 N -N)* 

The postulates P1-P2-P3 translate into the following postu- 
lates for network scheduling. 

PI'. Between any two schedules m and 7r 2 , suppose that 
none of the N HoL packets prefer 7r 2 to 7Ti and at least 
one HoL packet prefers 7Ti to 7r 2 . Then, we should not 
choose 7r 2 . 

P2'. For given HoL packets, let ir be the outcome of the 
election as per the above preferences for schedules. 
Then, by renumbering queues while retaining the same 
HoL preferences, the outcome of election should be only 
renumbered w. In other words, the election does not give 
unfair priority to any port and thus is symmetric in its 
inputs. 

P3'. Suppose schedule m is preferred to 7r 2 by the election. 
By adding the same fixed constant to — d n {r) for all n, 
the outcome of the election should remain unchanged. 
The election algorithm of Goodman and Markowitz suggests 
that the following schedule S(t) should be chosen: 

S(t) G argmax(— d(r), ir) 
Thus, the algorithm is the maximum weight schedule (MWS) 
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where weight is given by — d n (r) at time r. With this, we now 
formally state the algorithm. 

VI. Most Urgent Cell First (MUCF(/)) Algorithm 

We now describe the MUCF(/) scheduling algorithm for 
a constrained queuing system Af. First, some notation. Af' 
will denote the shadow CFN of Af, and p T n will denote 
the Head-of-Line (HoL) packet of queue n in Af at the 
beginning of the time slot r. Let a„(r) and d n (r) denote 
respectively the arrival and departure times of p T n in Af'. 
f : R — > M denotes a non-decreasing bi-Lipschitz continuous 
function with Lipschitz constant p > 1 i.e., for any x, y G R, 
1 /p\x— y\ < \f(x)—f(y)\ < p\x-y\. Without loss of generality 
we will assume that /(0) = 0. Thus, f(x) > for x > 0. In 
each time slot r define the urgency of a non-empty queue n as 
r — d n (r). This will be denoted by U n (r). The urgency of an 
empty queue is defined as — max{0, — mirin.Q^jv^o J7„(r)}. 
Finally, U[(t) denotes / (U n {r)). 

The scheduling algorithm for the constrained queuing sys- 
tem comprises queuing policies at each of the N constrained 
queues and M output queues, and choosing a feasible schedule 
of servers from 5? '. The three components of the algorithm are 
as follows: 

1. The arriving packets are queued according to 
the FIFO queuing policy in each of the N 
constrained queues. 

2. For a packet p in Af, let d(p) denote its departure 
time from Af' . Then, the arriving packets in 
each of the M output queues are queued in the 
order of their respective departure times from 
Af' . More formally, in every output queue r 
packet p will be ahead of every packet p' 
satisfies d(p') > d(p). 

In each time slot t, the algorithm chooses 
feasible schedule S(t) from 5? using a 
criterion as follows: 




S(t) e argmax(?7-' 7 (r), 7r) 



(id 



VII. Throughput of MUCF(/) Algorithm 

The previous section described how we arrived at MUCF 
algorithm as a fair algorithm based on preferences obtained 
from a shadow CFN. As established in the previous section, 
Theorem 1 implies that MUCF is the only algorithm that 
satisfies the desirable postulates Pl'-P2'-P3'. In this section, 
we state and prove the throughput optimality property of 
the MUCF algorithm. The proof of the algorithm is non- 
traditional and requires new techniques that may be of interest 
for analysis of such non-queue based weighted algorithms. 

Theorem 2: Consider a constrained queuing system with an 
arbitrary set of constraints . Suppose the system is loaded 
with an i.i.d. Bernoulli arrival process, and is operating under 
the MUCF(/) algorithm with /(•) a bi-Lipschitz function. 
Then, if the rate vector is strictly admissible, the queuing 
network is rate stable. 

Before we prove theorem 2 we need the following notation 
and lemmas. 



Notation. First, some useful notation. Consider the HoL 
packet p T n of queue n in network Af at the beginning of the 
time slot r. As before, let a„(r) be its time of arrival and 
dn(r) be the time of its departure from Af', U n (r) be its 
urgency as defined above, and W n (r) be its waiting time (i.e. 
t — a„(r) if the queue is non-empty and if it is empty). Let 
Wl{r) denote / (W„(r)) and F(y) denote f(x)dx. Also, 
define A„(r) as W n {r) — XJ n (r). We note that if queue n is 
empty, then W n (r) = and U n (r) is as defined above. Hence, 



A„(t) is always non-negative. Let 



= 1,2,. ..,M, 



denote the length of the k th busy cycle at output queue m in 
Af'. Finally, for any function g : R — ► R and a vector v e R N , 
g(v) e denotes {g{v n )). 

Lemma 1: Let L(t) := (F(W(t)),X) 
E„ F {Wu{t)) A„. Then, under the MUCF(/) algorithm with 
/(•) a bi-Lipschitz function and A being strictly admissible, 
there exists an e > such that 

E [L(t + 1) - L(t)\ < -eE [\W(t)\] + 2E [|A(r)|] + K, 

for some constant K. 

Lemma 2: Let m (r) denote maxo<K T Bj,, for m = 
1,2, ... ,M. Then, under a strictly admissible A with the 
output queues of Af' operating under a Work Conserving 
Policy (WCP), the following is true for all t and 1 < m < M, 

E[6 m (r)] <0(logr) 

We will first prove the result of Theorem 2 assuming the 
results of Lemmas 1 and 2, and defer their proof until after 
the proof of Theorem 2. 

Proof of Theorem 2: We first note that if queue n is 
non-empty then 

A„(r) < max B k m (12) 

0<fc<T 

where m is the destination output queue of packet p T n . This 
is true because: A„(r) denotes the waiting time of p T n in its 
destination output queue in Af' , and hence cannot be more 
than the length of the busy cycle it belongs to. Since there 
can be at most r busy cycles up to time r and p T n arrived to 
Af' before r, (12) should be true. 

Therefore, from lemma 2 and (12) it follows that: 



E[A„(r)] <0(logr) 



(13) 



If queue n is empty, then by definition it follows that either 

A ra (r) = or A„(r) = d n '(r) — r, where queue n' is 
non-empty. Since, A;(t) > V/,r, we have from (13) that 
E[A„(t)] < E[A„/(r)] < O(logr). Hence, (13) is valid 
even for empty queues and it follows that: 

E[A(t)] < O(logr) (14) 

From lemma 1 and (14), we obtain the following: 

E [L(t + 1) - L(t)} < -eE [\W(t)\] + O(logr) + K, (15) 

Telescopic summation of (15) from 1, . . . , r, we obtain (after 
cancellations), 



E[L(r + l)] < E[L(0)] - eE 



i=i 



+ 0(t log t)+tK, (16) 
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Now, the network starts empty at time 0. Therefore, E [£(0)] = 
0. Further, L(-) is non-negative function. Therefore, (16) gives 
us 



eE 



i=i 



< O(nlogn) + nK. 



Dividing both sides by et log t, we obtain 
1 



E 



rlogT 



<0(1). 



(17) 



(18) 



Let X T = ±£[ =1 |W(*)| and Z r = i§f From (18), we 
have E \Z T \ < 0(1) < oo for all r. Now, we claim that 



Pr lim-|VF(T)| = =1. 



(19) 



The proof of (19) is presented later. Before that, we use it 
to complete the proof of rate stability of the algorithm. Now, 
at time t the waiting time of the HoL packet of queue n is 
W n (t). Under FIFO policy and due to at most one arrival per 
time slot, we have that the queue-size of queue n at time t, 
Qn(t) < W n (t). From (19), we have that 

lim = 0, with probability 1. (20) 

t^QO t 



Now, Q n (t) observes the following dynamics: 

Qn{t) = Qn{0) + E Mr) - D n (t), 



(21) 



T<t 



where the second term on RHS is the cumulative arrival to 
queue n till time t while the third term is the cumulative 
departure from queue n till time t. By strong law of large 
numbers (SLLN) for Bernoulli i.i.d. process we have that: 



lim 

t^oo t 



jE A «( r ) = A »- 



T<t 



Using this along with (20) and (21), we obtain 
DJt) 

lim = A„, with probability 1, Vn. 

t^oo t 

This completes the proof of Theorem 2 assuming claim (19). 
We prove this next. 

Suppose (19) is not true. Then, since |W(t)| > we have 
that for some 6 > 0, 



Pt(\W(t)\ > St, i.o.) > S, 



(22) 



where "i.o." means infinitely often. Now if |W(r)| > St, 
then there exists an HoL packet that has been waiting in 
the network for time at least St/N 2 . This is true because 
|W(t)| is the sum of weighting times of at most N 2 HoL 
packets. Call this packet p. This packet must have arrived at 
time < t — St/N 2 = r(l — SN~ 2 ). Since waiting time of a 
packet increases only by 1 each time-slot, the waiting time of 
the packet p must be at least 0.5St/N 2 in time interval [n, r], 
where n = r — 0.5<5t7V~ 2 = r(l — 0.5SN~ 2 ). Now, consider 
any time r' s [t\ , r] . The packet waiting at the HoL of the 
queue that contains p must have waiting time higher than that 
of p due to FIFO ordering policy. Therefore, the contribution 



to | W(t') | by HoL packets of the queue that contains packet 
p is at least 0.5<5r7V~ 2 . Therefore, we obtain 



E iwv) 



> 



5 2 t 2 
4iV 4 ' 



(23) 



Therefore, by the definition of X T and non-negativity of 
|W(-)|, we have the following logical implication: 



\W(t)\ >S T ^X m > 



S 2 i 



4iV 4 ' 

Thus, if (22) holds then by (24) we have 



Pr [X T > 



47V 4 ' 



o. > 8. 



(24) 



(25) 



Now observe the following relation of X t : since |W(-)| > 0, 



Hence, for a > 1, 



X tn > 1 - - 



(a-l)t 



(26) 



Since 



1 - 



(a-l)t 



exp (—a + 1) 



we obtain that for there is large enough t a such that for t > t a , 

for any t' G [t,1.5t) 



Xi.5t > 3^*'- 



(27) 



Define, Yf. — X 15 k for k > 0. Then, the following are direct 
implications of (27): for any 9 > 0, 

X T > 9t, i.o. =>- Y k > 9l.5 k /3, i.o.; 

Yfc > 36»1.5 fe , i.o. => X T > 9t, i.o.. 

Therefore, to complete the proof of (19) by contradicting 
(22), it is sufficient to show that for 9 = 3S (since N > 1), 

Pr (Y fc > 91.5 k , i.o. ) = 0. 

For this, let event E k = {Y k > 9l.h k }. Then, from E \Z t \ < 
0(1), relations Y k = X x ^ k , Z k = and Markov's 

inequality we obtain that 



Pr(£ fc ) < O 



1.5 fc 



Therefore, 



oo. 



k k 

Therefore, by Borel-Cantelli's Lemma, we have that 

Pr(£ fe i.o.) = 0. 

This completes the proof of (19) and that of Theorem 2. 
Proof of Lemma 1: Define the following: for all n 

W n (T+l)=W n (T) + l-p n S* n (T) 
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where S*(t) is the schedule of MUCF algorithm and f3 n 
is the inter-arrival time for the arrival process to queue n. 
When the queue is empty, treat (3 n as an independent r.v. 
without any meaning, while if queue is not empty then treat 
it as the inter-arrival time between the packet being served 
and the packet behind it. In either case, due to the FIFO 
policy j3 n is totally independent of the scheduling decisions 
performed by the algorithm till (and including) time r and 
the information utilized by the algorithm. Therefore, we will 
treat it as an independent random variable with Geometric 
distribution of parameter A„ (since arrival process is Bernoulli 
i.i.d.). Consider the following: for any r, 



F 



W n (r+1) 

"(W n (r + 1)^ -f(w„(t)) = J f(x)dx (28) 

W„(t) 

It is easy to see that, 



W„(t+1) 



l-/3„S;(r) 



J f{x)dx= J f(y + W n (r))dy (29) 

ff„(r) 

Since /(•) is non-decreasing bi-Lipschitz continuous with 
/(0) = 0, we have 

f(y + W n (r)) = f(y + W n (r)) - f {W n {r)) + f (W n (r)) 
< If (V + W n (r)) - f (W n (r))\ + W£(t) 
<p\y\ + Wf(r) (30) 



Now, it follows from (28), (29) and (30) that 

f(w„(t + 1)) -F(w n (T)) 
< p(l-(3 n S; i (T)) 2 + (l-f3 n S* n (T))Wf(T) 



Using (31) and the fact that [i n is a Geometric r.v. with mean 
1/A„, we have the following: 



E 



KF(w n {T + 1)) - KF(w n (r)) 

n n 

Y, Wi(r)X n - Y Wi(r)S* n (r) + p]T A„ 

n n n 

-2pY / S* n (r)+pY S n( T )K 1 



W(t) 



(32) 



Here, we have used the fact that S*(t) G {0, 1} and hence 
(S*(r)) 2 = S*(r). Using fact that £„ A„ < N, E„ S*(r) < 
N and A„ 1 < oo for all n such that A„ =/= 0, we obtain that 



E Y *nF(w n (T + 1)) - Y AnF(w n (r)) 

_ n n 

< (Wf(r),\-S*(T))+K, 



W{t) 
(33) 



where K is some large enough constant. Now, define S w (t) 



as 



S w (t) = argmaxV(l^ / (T),7r) . 

7reJ^ m ax „ 



That is, S w (t) is the maximum weight schedule with weight 
of queue n as W,((t). Consider the following: 

(Wf(T),X-S*(r)) 
= (W f (r), A - S w (t)) + (W f (r) - U f (T) 1 S w (r) - S*(t)) 
+ (UI(t),S w {t)-S*(t)). (34) 

From the definitions of S*(t),S w (t), A(t)(= W{t)-U{t)), 
and bi-Lipschitz continuity of /(•) it follows that 

{U*(T),Sr(r)-S*(T)) < (35) 
(Wf(T)-Uf(T),S w (T)-S*(r)) < p(A(r),l)(36) 

Now, for strictly admissible A, we obtain that for some 7 6 
(0,1), 

(Wf(T),X-S w (r)) 

= (Wf(T),J2^n k )^{Wf(T),S w (T)) 
k 

= ^a fc (^(T),7r fc )-(^(r),^(r)) (37) 

k 

Since S w (r) is the maximum weight schedule with weight of 
queue n as W„(r): 

(W f (T),n k ) < (W f (r),S w (T)) Vk (38) 

Thus, it follows from (37) and (38): 

(Wf(T),\-S w (T)) < - 1 {Wf(T),S w (r)). (39) 

Now, since all N entries can be covered by N distinct feasible 
schedules, it follows that the weight of maximum weight 
matching is at least 1/N the sum of the weights of all entries. 
That is, 



(31) (W f (T),S W (T))>^Y W n(T) 



N 

n 

\wHj) 

N 



> IF(t)| 



(40) 



P 



N 



The last inequality follows from the bi-Lipschitz continuity of 
/(•). Combining (33)-(40) and taking further expectation with 
respect to W(t), we obtain 



E 



(41) 



Y KF(w n {r + 1)) - Y A„F(V(r)) 

_ n n 

<-e^[\W(r)\]+pE[\A(r)\)+K, 

where e — jfc. To complete the proof, note that if queue 
n is non-empty after service at time n, then W n (r + 1) = 
W„(t+1). Else, W n (r+l) = 0. Therefore, F^„(r+l)J > 

F(w n (r + ifj. This inequality, along with (41), implies the 
desired claim of Lemma 1 . ■ 
Proof of Lemma 2: This result corresponds to a constraint- 
free network in which the scheduling at different output queues 
is independent. Hence, we will prove the result for a single 
queue operating under a WCP and strictly admissible loading. 
We will use the same notation, but with the subscript m's 
dropped. For a single queue operating under a WCP and 
strictly admissible loading, busy cycle lenghts form an i.i.d 
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process i.e., B k are i.i.d. We will now look at the large 
deviation behavior of this process. Denote the cumulative 
arrival process during the k th busy cycle by I(t). Then for 
large enough t, 

Pr (B k >t) < Pr (I(t) - t > 0) < Ccxp(-Dt) (42) 

where C and D are some non-negative constants. The last 
inequality follows from Cramer's theorem. Let Q denote the 
random variable maxfc< T B k . Then, we have the following: 

e [e] = Pr (@ > *) = E Pr ( e > *) + Pr ( e > *) 



t<r 



t>r 



<r 



t>r 



Pr (6 > t) 



(43) 



(43) is true for any non-negative integer T. In particular, 
choose r large enough such that (42) is true V£ > T. It now 
follows from union bound that 



E 



Pr (6 > i) < E Pr > *) 

fc<r t>r 
< 0(ncxp(-OT)) 



(44) 



The second inequality follows from (42). Now, by choosing 
r = O(logr) we can bound J2t>r ^ r (® > by ^ now 
follows from (43) that 



E 



maxfl 

k<r 



<0(logr) 



(45) 



VIII. Experiments 

We carried out simulations to compare the performance 
of our algorithm in the context of IQ switches. We as- 
sumed a FIFO queuing policy at the input ports of the IQ 
switch. We compare the performance of our algorithm with 
Longest Queue First (LQF) and Oldest Cell First (OCF). 
We used a fixed-length packet switch simulator available at 
http://klamath.stanford.edu/tools/SIM/. 

We first explain the simulation settings: The switch size 
is N = 16. The buffer sizes are infinite. The policy used is 
FIFO. All inputs are equally loaded on a normalized scale, and 
p € (0, 1) denotes the normalized load. The arrival process is 
Bernoulli i.i.d. We use a Uniform load matrix, i.e., Ay = 
p/N Mi,]. We ran our simulation for 2.1 million time steps 
removing the first 100,000 time steps to achieve steady-state. 

Because we are approaching this problem from the per- 
spective of fairness, we evaluate the aforementioned switching 
algorithms in terms of Latency and Output-Queue (OQ) Delay. 
OQ delay is defined as the difference of the departure times 
of a cell in the input queued switch and the shadow OQ 
switch. Further, the goal cannot only be to achieve a better 
expected latency, but in fact, we wish to value consistency, 
or relatively few deviations from the mean. One measure 
for this are higher moments of the variables. Thus, here we 
provide plots for the logarithm of first and second moments 
of both Latency and the OQ Delay versus a uniform load of 
p. 6 plot respectively the logarithm of the first and second 
moments of the latency. We observe that for lower loads, i.e., 




Fig. 5. Comparison of the logarithm of Expected latencies of different 
scheduling algorithms. 




a 13 



Fig. 6. Comparison of the logarithm of second moments of the latencies of 
different scheduling algorithms. 



for p < 0.45 the performance of all the three algorithms is 
almost the same. But for higher loads, the first moment of 
LQF and MUCF are better than OCF. Fig. 6 shows that in 
terms of the second moment, MUCF performs the best and 
LQF the worst, with OCF lying between. This is in line with 
our expectation because, as mentioned earlier LQF is not fair 
and hence performs badly at higher moments. MUCF performs 
better than OCF for both the moments. 

Figures 5 and 6 correspond to latency and figures 7 and 
8 correspond to OQ delay. We observe that MUCF performs 
better than the other two algorithms for both the metrics at 
all the loads, especially for the second moments illustrating 
the fairness. Thus, the simulations illustrate that MUCF better 
tracks the performance of an OQ switch than LQF and OCF. 
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Fig. 7. Comparison of the logarithm of Expected output queued delays of 
different scheduling algorithms. 
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Fig. 8. Comparison of the logarithm of second moments of the output queued 
delays of different scheduling algorithms. 

IX. Conclusion 

In this paper we considered the problem of designing a 
fair scheduling algorithm for constrained queuing systems. 
Fairness in networks is not only an intuitively desired goal 
but also one with many practical benefits. Most of the existing 
work concentrates on fairly allocating bandwidth to different 
flows in the network. A major limitation of this approach is 
that it disregards the packetized nature of flows. We overcome 
this problem and propose a packet based notion of fairness by 
establishing a novel analogy with the ranked election problem. 
Ranked election is a widely studied problem in the Economics 
literature and this analogy allows us to leverage that work. This 
results in a packet based notion of fairness and an algorithm 
to achieve this fairness. 

Rather surprisingly, the algorithm turns out be the famil- 
iar MW style algorithm. Moreover, it does not require the 



knowledge of flow arrival rates. Our fairness algorithm also 
fits into the utility maximization framework that is more 
popular for designing fair algorithms. This, in some sense, 
validates our approach. We also prove that our algorithm 
is throughput optimal. This result is very crucial since the 
emulation approach already achieves fairness, but with a loss 
of throughput. Also, the proof is non-trivial and requires some 
non-traditional techniques to be introduced because existing 
proof techniques don't directly apply. The proof techniques 
we introduce are more widely applicable to similar problems 
and this is another important contribution of the paper. Finally, 
our simulation results corroborate the fact that our algorithm is 
better at providing fairness than the more popular algorithms 
in the context of input queued switches. 
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